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Abstract. Weak gravitational lensing is considered to be one of the most powerful tools to study the mass and 
the mass distribution of galaxy clusters. However, weak lensing mass reconstructions are plagued by the so-called 
mass-sheet degeneracy - the surface mass density k of the cluster can be determined only up to a degeneracy 
transformation k —* k' = \k+ (1 — A), where A is an arbitrary constant. This transformation fundamentally limits 
the accuracy of cluster mass determinations if no further assumptions are made. We describe here a method 
to break the mass-sheet degeneracy in weak lensing mass maps using distortion and redshift information of 
background galaxies and illustrate this by two simple toy models. Compared to other techniques proposed in 
the past, it does not rely on any assumptions on cluster potential; it can be easily applied to non-parametric 
mass-reconstructions and no assumptions on boundary conditions have to be made. In addition it does not make 
use of weakly constrained information (such as the source number counts, used in the magnification effect). Our 
simulations show that we are effectively able to break the mass-sheet degeneracy for supercritical lenses, but that 
for undercritical lenses the mass-sheet degeneracy is very difficult to be broken, even under idealised conditions. 
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1. Introduction 

Galaxy clusters have been the focus of a very intense ongo- 
ing research in the past years. They are the most massive 
bound structures in the Universe; moreover, their large dy- 
namical time scale allows them to retain the information 
about their formation history. The mass and the mass dis- 
tribution of galaxy clusters is particularly important for 
cosmological studies and therefore provides a critical test 
of the Cold Dark Matter (CDM) paradigm. 

Weak gravitational lensing techniques are a unique 
tool to measure the projected mass of a cluste r. This effect 
has l ong been recognized and measured (see iTvson et alJ 
1990 1), however it was o nly aft er the pioneering work of 



Kaiser fe Squires! l|l993l see e.g. IClowe fe SchneiderlEoOll 
2002j for further applications) that the field began to flour- 
ish. Since then several other mass reconstruction tech- 
niques and a pplications of weak lensing have been devel- 
oped (seee.g.lKaiser et al.ll995tlLombardi fc Bertinll999l: 



Seitz & Schneider ll997t Bridle et alJll998tlMarshall et all 
2002t iHoekstra et al.lll998(l . 
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Unfortunately, all these methods suffer from the fact 
that the projected surface mass density n can be deter- 
mined only up to a degeneracy transformation k — > k' = 
Xk+ (1 — A), where A is a n arbitrary consta nt. This invari- 
ance (first recognized by |FaicQ_et_ajJllQS5l h i the context 
of strong lensing and bv lSchneider fc Seitzl l|l995|) in the 
context of weak lensing) leaves the relation between the 
intrinsic and observed cllipticity unchanged. Therefore it 
can not be broken by using only measurements of the dis- 
tortion of the background sources, if these all lie at the 
same distance from the observer and fundamentally limits 
the accuracy of cluster mass determinations if no further 
assumptions are made. 

A naive solution to this problem is to constrain A by 
making simple assumptions about k. For example, one can 
assume that the surface mass density is decreasing with 
distance from the centre, implying A > 0. In addition, k is 
likely to be non-negative, and so one can obtain an upper 
limit on A (for k < 1). 

More quantitatively, with the use of wide field cam- 
eras one might try to assume that n ~ at the boundary 
of the field, far away from the cluster center. However, 
if we consider for example a M v - lr = 1O 15 M0 cluster at 
redshift z = 0.2, we expect from N-body simulations to 
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have a projected dimensionless density of about k ~ 0.005 
at 15 arcmin from the cluster center (Douglas Clowe, 
private communication). Hence, even with the use of a 
30 x 30 arcmin camera we expect to underestimate the 
virial mass of such a cluster by ~ 20%. 

Another approach to determine the mass of a clus- 
ter from weak lensing data is to fit the shear signal with 
a param etric model (such as SIS or NFW ). However, as 
shown in Ifilowe fc Schneider! (|200ll [2002) , it is extremely 
difficult to distinguish SIS from NFW at high significance, 
and thus the mass estimates become difficult. They con- 
clude that one of the main sources of uncertainty in deter- 
mining the cluster mass and profile determination is the 
mass-sheet degeneracy. 

Unfortunately, the mass-sheet degeneracy can not be 
lifted by using the shapes of the background sources alone 
ijSeitz fe Schneiderlll997(l . and so one needs to make use 
of additional information. One promising solution to this 
prob lem is provided by the so-called magnification effect 
(see iBroadhurst et alJ 11995) . Indeed, the local number 
counts of background sources is directly related to the 
magnification of the lens, which to first order is given 
by 1 + 2k. As a result, a careful estimate of the lo- 
cal density of background sources can lead t o a direct 
measurement of the proj ected density k (see iFort et all 
Il997t iTavlor et al.l fl998l for reported detections of this 
effect). In addition, other methods to measure cluster 
masses via magnification effec ts have been proposed (e.g. 
Bartel mann fc Nara,vaulll995l suggested to use the sizes of 
galaxies at a given surface brightness as a measure of the 
local lens magnification). All these methods, however, re- 
quire a fine calibration of external parameters, such as the 
number counts of unlensed sourc es, which is very diffi cult 
to obtain with desired accuracy ( Schneider et al.fe oOO'). 



In this work we will therefore focus on the possible 
methods to break the mass-sheet degeneracy by using dis- 
tortion and redsh i ft info rmation of background sources. 
ISeitz fc Schneider! l)l997|) already considered this problem, 
and they showed that the mass-sheet degeneracy can be 
weakly broken provided that the probability distribution 
of the redshifts for the observed sources is known with 
good accuracy. Recently, it has been shown that photo- 
metric redshifts based on accurate multi-band pho tometry 
can be extremely accurate (see e.g. lBemte3l2000|) . Hence, 
it is sensible to assume that the individual redshifts of the 
background sources can be known in weak lensing studies. 
As we will show later, this in principle allows one to break 
the mass-sheet degeneracy. 

This paper is organised as follows. In Sect. [5] we give 
the basic lens equations, introduce the notation we use, 
and introduce the problem of mass-sheet degeneracy. In 
Sect. |3] we focus on the cluster mass reconstruction tech- 
nique and we describe the problem of mass-sheet degen- 
eracy in more detail in Sect.^J In Sect. we describe the 
simulated data we are using and present the results. We 
conclude in Sect. El 



2. Statistical lensing preliminaries 

2.1. Notation and basic lens relationships 

In this pape r we use the standard weak lensin g notation as 
described in lBartelmann fc Schneider! I|200l[) . We consider 
a lens with projected surface mass density E(0), where 9 
denotes the position in the lens plane. For a source at 
redshift z and a lens at redshift Zd we define the critical 
surface mass density E cr (z) as 

c 2 D s 

where D represents the angular diameter distances; D s = 
D(0,z), -Dd = D(0,Zd), and Dd s = D{zd,z) are the an- 
gular diameter distances between the observer and the 
source, the observer and the lens, and the lens and the 
source, respectively. 

Since we will work with sources at different redshifts, 
we factorize the redshift dependence of the lens conver- 
gence k and of the shear 7 as 



k(0, z) = Z(z)k(0) , 7 (0, z) = Z(z) 7 (0) 



(2) 



where Z(z) is the so-called "cosmological weight" func- 
tion: 

lim^ooEc^Zd,^) 
z ( z ) = s H(z - z d ) ■ (3) 

The Heaviside step function H(z — Zd) accounts for the 
fact that the sources that are closer to the observer than 
the deflector are not lensed. Note that, as suggested by its 
name, Z{z) is cosmology dependent. For an Einstein-de 
Sitter cosmology it becomes 



Z(z) = 



y/T+z- VT+ 



2d 



R(z-z d ) 



(4) 



In lLombardi fc Berth] l)l999() cosmological weights were 
calculated for different cosmologies. The authors have 
shown that the differences between Einstein-de Sitter and 
the nowadays assumed standard cosmology (i.e. Q m = 0.3, 
f^A = 0.7) are not significant for the purpose of cluster- 
mass reconstructions. Therefore we will from now on use 
Einstein-de Sitter cosmology. Similarly to n and 7, we de- 
fine the redshift-dependent reduced shear g(8, z) as 

Z(zM6) 



9(0, z) 



(5) 



1 - Z(z)k{0) 

Throughout this paper we use the complex ellipticity 
e defined in terms of the second brightness moments Qij 

as 

= On - Q22 + 2iQi2 . , 

£ ~ 0ii + Q 22 + 2(Q 11 Q 22 -Q 2 2 ) 1 /2 ■ U 

The transformation between the source ellipticity e s and 
image ellipti city e is given as a func tion of reduced shear 
g(9,z) (see lSeitz fc Schneiderlll997|l . 

■g(e,z) 



l-g*(d,z)e 

l-g(0,z)e* 
e*-g*{0,z) 



for \g(6,z)\ < 1 



for \g(6, z) I > 1 



(7) 
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The inverse transformation is simply given by swapping 
e s with e and by substituting — g for g. 

Under the assumption that the intrinsic ellipticity dis- 
tribution is isotropic, (e s ) = 0, the expectation value for 
the image ellipticity at redshift z becomes 

(g(6,z) if \g(6,z)\ <1, 
<*(*)> =1 ® 

VTW^) otherwise - 

In the weak lensing approximation which we define by 
ft <§; 1, |7| <C 1 (thus \g\ <C 1) the expectation value is 
given by (e(z)) = 7(0,2). 

2.2. The problem of the mass sheet degeneracy 

In the simple case of background sources all having the 
same redshift, the mass-sheet degeneracy can be under- 
stood just using the above equations. Indeed, consider for 
a moment the transformation of the potential if) 

^(8,z)^iP'(0,z) = ^e 2 + \^(0,z) , (9) 

where A is an arbitrary constant, k and 7 are related to the 
potential ip through its second partial derivatives (denoted 
by subscript), namely 

K = ^W\ll + "0,22) , 71 = 2^ ) < 11 ~ ^< 22 ) ' 72 = ^' 12 ■ 

(10) 

From it follows that k transforms as 

k(0, z) -> k'{6, z) = \k(8, z) + (1 - A) , (11) 

and similarly the shear changes as 7(0,2:) to Xj(9,z). 
Th erefore the reduced shear g(0,z) remains invariant. 

ISeitz fc Schneider! l)l997(l have shown that in the case 
of a known redshift distribution, a similar form of the mass- 
sheet degeneracy holds to a very good approximation for 
non-critical clusters, i.e. for clusters with \g(8, z)\ < 1 for 
all source redshifts z. In such a case the standard weak- 
lensing mass reconstruction is affected by the degeneracy 

, , (1 - A) (Z(z)) , . 

— K ^ XK+ { z Hz )) ' (12) 

where (Z"(z)) denotes the n-th order moment of the dis- 
tribution of cosmological weights. As a result, standard 
weak-lensing reconstructions are still affected by the mass- 
sheet degeneracy even for sources at different redshifts; 
moreover, simulations show that the degeneracy is hardly 
broken even for lenses close to critical. 

In this paper, we use the information of individual red- 
shifts of background sources to break this degeneracy. As 
an illustration of the effect, suppose that half of the back- 
ground sources are located at a known redshift z^\ and 
the other half at another known redshift z^ 2 \ Then, the 
weak lensing reconstructions based on the two popula- 
tions will provide two different mass maps, k'(0, z^ 1 ') and 
k'(9, z^ 2 '), leading to two different forms of the mass-sheet 



degeneracy. In other words, the two mass reconstructions 
(i = 1, 2) are given by 

k'(9, «W) = A«/tt(0, «W) + (1 - A (i) ) (13) 

where we have denoted Kt(6, z^) the true projected k of 
the lens at the angular position for sources at redshift 
zW. Since the transformation l|13(l holds for any 0, we 
have a system of equations to be solved for A 1 - 1 - 1 and . 
The relation between n t (6, z^ 1 ') and Kt(0,z^) is known, 
namely from Eq. (J2J follows 

Kt (e, zW)z(zM) = Kt(e, z^)z(z^). (w) 

Suppose one measures both k'(0,z^') at N different po- 
sitions 9j, this gives us a system of 2N equations to be 
solved for A^ and K t (0j). The mass-sheet degeneracy is 
therefore at least in theory lifted. 

It is interesting to observe that this argument only 
applies to relatively "strong" lenses. Indeed, for "weak" 
lenses, i.e. lenses for which we can use a first order ap- 
proximation in k and 7, the expectation value of mea- 
sured image ellipticities is (e(.z)) = "f(6,z). In such case 
the degeneracy of the form 

^e,z)^^'{e,z) = ^^e 2 + ^{e,z) (15) 

leaves the observable 7(6, z) unchanged. As a result, the 
method described above cannot be used to break the 
mass-sheet degeneracy for these lenses. Only when the 
(1 — Z(z)k) term in the reduced shear becomes impor- 
tant and g{9,z) can be distinguished from 7(0,0) in the 
(noisy) data, we will be able to make unbiased cluster 

mass reconstructions. 

The aperture mass measures teaiserlll995t ISchneiderl 

Il99(fl are very convenient for measuring the weak lensing 
signal. A simple form of such measures is the so-called £ 
statistics, 

C(0;0i,02) = R(Mi)-k(0;0i,i*2) • (16) 
which gives the difference between the mean surface mass 
densities in a circle of radius $1 at position 6 and in an 
annulus between the radii $\ and It measures the mass 
contrast, and thus can not be used to determine the ab- 
solute mass scale. Therefore despite the fact that it is not 
affected by the mass-sheet degeneracy transformation of 
the form i|15|) , it will not be discussed further in this work. 

In conclusion, using individual redshifts of background 
sources provides the most direct method to break the 
mass-sheet degeneracy. The simple argument sketched 
above for two redshift planes will therefore be considered 
in more detail in this paper and generalised to background 
galaxies at different redshifts. In the following we will use 
mock catalogues of image ellipticities, generated under op- 
timistic assumptions. Such an approach will allow us to 
investigate the minimum lensing strength needed to be 
able to lift the degeneracy in the forthcoming observa- 
tions. As we will see below, this straightforward method 
can be successfully used to break the mass-sheet degener- 
acy in statistical lensing when applied in the strong lensing 
regime. 
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3. Cluster mass reconstruction from image 
distortion 

3.1. Maximum-likelihood approach 

In order to further investigate the effect of using individual 
redshifts in the cluster mass reconstructions, it is useful 
to adopt a Bayesian approach and to write the likelihood 
function of a given observed configuration of galaxy ellip- 
ticities. 

Let us call p e s(e s ) the probability distribution of in- 
trinsic source ellipticities. For simplicity and better under- 
standing we will here (and throughout this paper) assume 
the following truncated Gaussian distribution 



1 



2vr(T 2 [l -cxp(-l/2cr 2 )] 



cxp(-|e s | 2 /2<7 2 ) • 



(17) 

Note however that the results in this section are indepen- 
dent of the particular choice for p e s. The observed dis- 
tribution v f (e I q) is rela ted to the intrinsic one by (see 
iGeiger fc SchnelderHTflfl^ 



Pe(e | g) =Pe*(e s (e | <?)) 



dV 



d 2 t 



(e\9) 



(18) 



The Jacobian determinant from the previous equation is 
given by 



d 2 c 



(e|<?) 



(|g| 2 -l) 2 
\eg*-l\* 

(M 2 -i) 2 



for \g\<l, 



for \g\ > 1 



(19) 



In the weak lensing limit, \g\ <C 1, the Jacobian determi- 
nant is simply unity. 

In general, the measured ellipticity e m will differ from 
e because of measurement errors: 

e m = e + e orr . (20) 

The error e err is a random variable whose distribution de- 
pends on the details of the ellipticity measurement algo- 
rithm. In the following, for simplicity (and throughout this 
paper), we will assume that e err is distributed according 
to a Gaussian distribution with dispersion cr err . The ac- 
tual sha pe measurement w hich corrects for PSF smearing 
(see e.g. lKaiser et al.ll995|) can yield measured ellipticities 
with |e m | > 1. 

In our case, we write the probability distribution for 
the measured ellipticities as 



Pe»(e m |<?) = Pe(e\g)p^(e-e m )de 



(21) 



where p e crr (e orr ) is the probability distribution of measure- 
ment errors. This convolution takes into account the above 
mentioned measurement errors and therefore we do not 
need to discard galaxies with |e m | > 1. 

The likelihood function C is the product of probability 
densities for observed image ellipticities e™, and depends 



on the model parameters 7r through the reduced shear at 
the image positions {gi}: 



(22) 



Here N g is the number of observed galaxies with measured 
ellipticities ef . It is more convenient to deal with the log- 
likelihood function, defined as 



A, 



Z(tt) = -ln£(7r) = -^lnp £ (e, | g) 



(23) 



By minimising l(ir) we obtain the most likely parameters 
7r max given the observations. 

For each of the N g galaxies we need to calculate the 
probability distribution p £ m (e m | g) [cf. Eq. ijZIfl of mea- 
sured image ellipticity e™ given the reduced shear g. The 
evaluation of the full likelihood function is therefore non- 
trivial, since N s integrals need to be calculated. In sim- 
ple words, we are dealing with two Gaussian probability 
distributions for source ellipticities e s and the measure- 
ment error e crr . However, the final measured errors for 
observed image ellipticities e m is not distributed accord- 
ing to a Gaussian in general, since the source ellipticities 
are first being lensed and only then the measurement er- 
rors are added (i.e. the Jacobian determinant in l|18[) is 
not unity). 

If the reduced shear is small (i.e. for undercritical 
lenses), this problem can be solved by including the mea- 
surement errors in the source probability distribution p t n : 
in other words, if \g\ <C 1, the Jacobian determinant is 
unity and p e (e \ g) — p £ =(e s (e | g)). We can therefore in- 
terchange the convolution with p e o rr appearing in Eq. I|21|l 
with the lensing transformation of Eq. (|18fl . The calcula- 
tions are then trivial, since p e ^ and p e <=n can be taken to 
be Gaussians with dispersions o~% and <r 2 rr respectively; 
hence, we can just use for p e s a Gaussian with dispersion 

This is, however, not the case for critical lenses. As we 
show latter, the approximate form for the measured ellip- 
ticities [cf. Eq. (TJHJ] gives biased results for lenses with 
a large fraction of background sources having \g\ ~ 1. 
For the same reason, we should not discard galaxies with 
|e m | > 1 without properly accounting for their removal 
in Eq. (|21|l . Therefore, unless otherwise stated, we will 
use the probability distribution for measured ellipticity 
given in Eq. (|21Jl . It still does not account for the mea- 
surement errors on the redshifts of background sources, 
however these are less important. 



3.2. x 2 approach 

Finally, we note an alternative to the maximum-likelihood 
approach, namely a chi-square fitting. In our case, the x 2 
function can be written as 



x 2 w = E' 



(24) 
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where 



(25) 



Unfortunately, simulations show that such an approach 
does not give an unbiased result for critical lenses. This 
is due to two main reasons. First, the expression 1|25|) is 
only an approximation for the true variance, that con- 
tai ns higher (even) order moments of the distribution of 
e s l)Lombardill2000|) . Second and more important, elliptic- 
ity errors are not normally distr ibuted for lenses with large 
reduced shears, \g\ ~ 1 fsee iGeieer fc Schneiderl Il998l 
Fig. 5). In the maximum likelihood approach, on the other 
hand, the real distribution of errors is properly accounted 
for and in the asymptotic limit the method will therefore 
give an unbiased result. Hence, in the following we will 
only discuss the likelihood approach. 

4. The mass-sheet degeneracy 

From the discussion above, it is apparent that only galax- 
ies with \g\ ~ 1 contribute significantly to the removal of 
the mass-sheet degeneracy for two reasons: 

— The dispersion on the observed ellipticities is propor- 
tional to (l — |(e) 2 |) [cf. Eq. lt25|) ]. and so galaxies 
with \g\ ~ 1 provide more accurate estimates on the 
local reduced shear; 

— Recalling the argument discussed after Eq. ljT5|) , galax- 
ies with \g\ <C 1 present a simple mass-sheet degener- 
acy [cf. Eq. I|15|) ] which cannot be broken using the 
method described here. Therefore one needs the infor- 
mation from |<?| ~ 1 galaxies to break the degeneracy. 

It is important to realize that the likelihood function in 
the way we wrote it, already takes into account the first 
point mentioned above. As a result, galaxies with \g\ ~ 1 
are effectively more important for the purpose of break- 
ing the mass-sheet degeneracy, while those with \g\ <C 1 
do not contribute at all (as sketched in Sect. 12.2(1 . It is 
sensible, therefore, to take into account this point when 
writing the approximate form of the mass-sheet degener- 
acy presented in Eq. (|12|) . In other words, we assume the 
following transformation of (Z n (z)) 



(Z»(z)) - (Z»(z))' = 



Ei=l W i 



(26) 



where Wi are the w eight factors and for Wj = const, we 
recover the results of lSeitz fc Schneiderl l|l997t) . For giving 
more weights to galaxies with \g\ ~ 1, we will however 
use Wi = l/<7j, defined in (|25|l . As we mentioned before 
this is only an approximation for the true dispersion, how- 
ever as we will show latter it is adequate to describe the 
approximate mass-sheet degeneracy behaviour. 

5. Simulated data 

In order to test whether we can break the mass-sheet de- 
generacy by using redshift information on the background 
sources, we performed two simple tests on simulated data. 



We considered first the most simple model possible, 
a constant sheet of mass with external shear (Family I). 
More precisely, we set k{6) = Ci, 71 (d) = C2 , and 72 {6) — 
C3, with Ci constants. Thus, in this model the convergence 
and shear for each galaxy depend only on the redshift of 
the source and not on its position. Although, clearly, this 
model is completely unrealistic, it is very useful as a test 
for our method. Indeed, it is reasonable to assume that if 
we are not able to break the mass-sheet degeneracy in this 
simple and favorable situation, it is unlikely that we will 
be able to break it in more realistic and complicated cases. 
This simple model can also be used as an indicator of how 
strong the lens should be in order to obtain a reliable 
estimate for the average surface mass density k across the 
field. 

The second family we used is a non-singular model that 
approximates an isoth ermal sphere for large distances (see 
ISchneider etaH [l992) in which we allow for a constant 
sheet in surface mass density (hereafter Family II). The 
dimensionless surface mass density is given by 



k(6/6 c ) = k 



1 + e 2 / {20 2 c ) 
(i + eye 2 ) 3/2 



(27) 



where ko, k\ are dimensionless constants and 9 C is the core 
radius. 

We used the following recipe for generating mock cat- 
alogues of background sources: 

1. 2000 galaxies were drawn randomly across the 
field of 6 x 6 arcmin 2 , thus giving a density of 
55 galaxies arcmin" 2 . 

2. The intrinsic ellipticities e s were drawn according to 
lfT7|) characterized by a = a> = 0.15. 

3. We dr aw the redshifts of the background sources, fol- 
lowing ^raine^^^O <|l996|) from a gamma distribu- 
tion 



Pt.{z) 



■exp(-z/zo) , 



(28) 



with zq = 2/3; the mean redshift is (z) = 3zq = 2, 
and the mode is z mo( jc = 2zo = 4/3. The correspond- 
ing cosmological weights were evaluated assuming an 
Einstein-de Sitter cosmology. We put the lens at a red- 
shift Zd = 0.2. For the purpose of mass-sheet degen- 
eracy breaking one might think it would be of an ad- 
vantage to have a higher redshift lens, e.g. z^ — 0.4. In 
such a case most of the galaxies would lie at the steep 
part of the function Z(z) and therefore we would have 
a higher scatter of Zi values. However, this effect com- 
pensated with the fact, that the average reduced shear 
is lower for a lens at higher redshift. The effect of the 
number change of background sources further favours 
the low redshift cluster, however not significantly. 
4. For each galaxy, we evaluated the local shear according 
to the lens model, its position and redshift; then we 
lensed the galaxy ellipticities. Note that the positions 
of the source galaxies were not transformed, i.e. we 
neglected here the magnification effect of the lens. 
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Fig. 1. Top: Recovered parameter values (crosses) as a result of minimising the log-likelihood function l|23|l . For each 
of the ten sets of parameters (panel (a) 7i = 72 = 0.1, panel (b) 71 = 72 = 0.2) - denoted by squares - 100 mock 
catalogues were created using model Family I and the same family was used to fit the data. We use 3 free parameters 
(k, 71, and 72) for fitting, here we plot n and |7|. Lines correspond to the expected mass-sheet degeneracy calculated 
using the weighting scheme given by tOj = l/(Tj (solid lines) and wi = const, (dashed lines) in l|26(l . Bottom: Bias and 
variance for the recovered values of n for the data described above. 



The measurement error e crl on the observed elliptic- 
ities was drawn from the distribution (|17fl with a = 
<7 err =0.1 and added to the lensed ellipticities. 
In most cases we considered measurement errors on 
the redshifts of the galaxies to simulate the use of 
photometric redshifts. These errors were drawn from 
a Gaussian di stribution with cr zcrr = 0.06 (1 + Zi) (see 
Bcm'tez 2000); in adding the errors we ensured that 
the resulting redshifts are always positive. 



In generating the mock catalogues we have tried to sim- 
ulate an ideal case, since we are trying to answer the 
question when (and if at all) the mass-sheet degeneracy 
can be broken in statistical lensing mass reconstructions. 
From a comparison between independent observations of 
the same galaxies in COMBO-17 survey cr crr ~ 0.1 — 0.15 
(for each component of the ellipticity) can be estimated 
l|Kleinheinrichl EoO.Sh . The typical values of <7 e = are op- 
timistically ~ 0.2. However even a less optimistic case 
(<j e s = 0.3, <7crr = 0.15) does not change our conclusions. 
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Fig. 2. Recovered parameter values (crosses) as a result of minimising the log- likelihood function Eq. J23J. Panel (a) 
shows results obtained using p e (e | g) - Eq. I|18|l - for the observed probability distribution for ellipticities, while for 
the panel (b) we use (as in the rest of this paper) p e m(e m | g) - Eq. 1|21[1 . For the parameter set k — 1.0, 71 = 72 = 0.1 
- denoted by a square - 100 mock catalogues were created using model Family I and the same family was used to 
fit the data. Redshift errors were not added. We use 3 free parameters (k, 71, and 72) for fitting, here we plot k and 
|7|. Lines correspond to the expected mass-sheet degeneracy calculated using Eq. iffiijl . We used the weighting scheme 
given by Wi = const, for the dashed lines. For the solid lines Wi = l/<Ti was used, however cr crr = was employed for 
panel (a) only. This is due to the fact that we are using p e (e \ g) in (a) for the observed probability distribution for 
ellipticities, rather thanp c m(e m | g). 



5.1. Results of the model fitting to the simulated data 

For each set of true model parameters 7r t we gener- 
ated 100 mock catalogues. Using these, we searched for 
the most likely parameters 7r max by minimising the log- 
likelihood function GH1- We perform ed the minimisation 
with C-minuit l|.Tames fc R oos 1975), a routine which is a 
part of the CERN Program Library. C-minuit is designed 
to minimise a multi-parameter function and analyse its 
shape around the minimum. It simultaneously make use 
of the gradient as well as downhill simplex method. 

Despite the goodness of the routine, one has to be very 
careful when performing the minimisation, as the function 
Ij23(l has logarithmic singularities for \g\ = 1 when cr orr = 
is assumed. If, for a particular parameter set, a back- 
ground galaxy happens to have \g\ ~ 1, the log-likelihood 
function ll'MI) diverges and the minimisation procedure has 
difficulties to "climb" over such region, possibly leading to 
secondary minima. 

In the case of <7 e rr 7^ this is less likely to happen, 
as the convolution with p e on in (|21|1 is effectively smooth- 
ing the probability distribution. Unfortunately the inte- 
gration in H21J1 has to be performed numerically. The noise 
resulting from numerical calculations is therefore present 
in the log-likelihood function and the minimisation rou- 
tine is not able to search for a minimum, since the func- 
tion is not smooth. On top of that, the integrations are 
CPU consuming, one needs to perform them for every 



galaxy separately. In order to avoid these problems, we 
note that function p e m ||21l) depends on three parameters 
(without a loss of generality we can assume g to be real 
and accordingly transform e m ). We therefore evaluate p e m 
on a three-dimensional grid (with GNU Scientific Library 
qags routine, http://www.gnu.org/software/gsl) and 
use tri-linear interpolation to evaluate the log-likelihood. 

Finally, we stress that the model fitted to the data be- 
longed, for all simulations, to the same family as the orig- 
inal model used to generate the galaxy catalog. Although 
unrealistic, this assumption allows us to directly compare 
the deduced parameters with the original ones, and thus 
simplifies the evaluation and interpretation of the sim- 
ulation results. The free parameters of both models are 
7r = k, 71, 72 for Family I, 7r = kq, k\ for Family II. These 
correspond to the mass sheet degeneracy transformation, 
for the case where redshift information is not known; 
k{6) -> k'(0) = \k(0,z) + (1 - A) and 7 (0) -> 7 '(0) = 
X'f(9). By investigating the distribution of best fit param- 
eters 7r for both models, we can answer the question of 
how strong the lens need to be in order to lift the above 
mentioned degeneracy under idealised conditions. 

5.1.1. Family I 

Mock catalogues for Family I were generated using 71 = 
72 = 0.1 (see Fig. n panel a) and 71 = 72 = 0.2 (panel 
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Fig. 3. (a) Recovered parameter values (crosses) as a result of minimising the log-likelihood function Eq. I|23|) . For 
each of the four sets of parameters (denoted by squares) 100 mock catalogues were created . We use model Family 
II and the same family was used to fit the data. The model has 2 free parameters (kq and k%) - see Eq. Q27jl. Solid 
and dashed lines correspond to the expected mass-sheet degeneracy calculated using Wi = \jui and Wi = const, in 
Eq. (|26l) for the weighting scheme respectively (both almost overlap). Dot-dashed lines are given by ko + n\ = const, 
(b) Relative bias and variance for the recovered values of K\. 



b); 10 different values of k ranging from 0.5 to 1.5 were 
chosen for both sets. For each model we fitted 100 mock 
catalogues (generated as described above) and we marked 
with crosses the resulting best fit parameters (the original 
values of the parameters are represented as squares). In 
order to explore the importance of redshift measurement 
errors we have generated catalogues without and with red- 
shift errors as described above (in Fig. we show only the 
latter). 

Surprisingly, even for a relatively strong lens with 
k = 0.5 we are not able to break the mass-sheet degener- 
acy with good accuracy. For this model, the relative error 
on k is of the order of 20%-40%, which should be consid- 
ered very large, given the idealized conditions used in our 
simulations. Hence, the results of our simulations suggest 
that it is very difficult to break the mass-sheet degener- 
acy for non-critical lenses using shape information only. As 
expected, for data without redshift errors the fit slightly 
improves, though not substantially. 

On the other hand, for critical and close to critical 
lenses, we are effectively able to break the mass-sheet de- 
generacy. The surface mass density is well constrained for 
k > 1. As expected, the results show that galaxies with 
\g\ ~ 1 contribute substantially to the removal of the de- 
generacy. If the redshift uncertainty is not added the fit 
improves and the constraints are tighter. 

Finally, it is interesting to observe that in some cases 
(e.g. k = 1 and | — ^ | = 0.14) we obtain biased results if we 
use p e instead of p e ^ for calculating log-likelihood (see 
Fig. |2J). Namely, in Fig. we use the likelihood func- 
tion that implies first convolving the data with p e cn and 



only then performing lensing transformation [see discus- 
sion after Eq. I)23|l ] . This can severely bias results for lenses 
where many galaxies have \g\ ~ 1. Note that for this case 
also the expected mass-sheet degeneracy calculated using 
Wi = l/cri in Eq. (|26|l needs to be calculated with cr crr = 0, 
as we are not correctly accounting for the measurement 
errors in this case. 

In addition to the global, mass sheet invariance, there 
is a local point invariance. Namely, looking at a single 
imag e ellipticity we can not distinguish between g and 
1/g* llSchneider fc Seit,zlll995l) . This invariance is in prac- 
tice easily broken, either by assuming a profile or simply 
by stating that k increases towards the cluster centre. In 
the case of all sources being at the same redshift, for the 
Family I, however, since k = const, this is a global in- 
variance. In such a case the log-likelihood function has 
two equivalent minima, one corresponding to g and one to 
1/g*. However since we do have the redshift information, 
this invariance is broken in most cases. 

For our special case with k = 1 and I7I = 0.14 this is 
not so simple. The average absolute value of reduced shear 
for the galaxies is (\g\) ~ 1 and therefore for most galaxies 
the two minima lie close together. If the probability distri- 
bution of lensed ellipticities is properly accounted for, the 
resulting solutions give an unbiased estimate for the true 
minimum (see Fig.|3)). Whereas if this is not the 
in Fig. [2^, the results can be biased. 

This special case was presented as an extreme example. 
In practice such a lens is of course unrealistic. However we 
want to stress that critical lenses need to be treated with 
caution when using statistical lensing. 
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5.1.2. Family II 

Mock catalogues for Family II were generated using kq = 
0.9, 6 C — 1.5 arcmin, and 4 different values of K\ ranging 
from 0.2 to 0.5. We fitted these data with the same family 
of models and using k and K\ as free parameters; note 
that we fixed the core radius to the same value as the orig- 
inal profile. Figure ®t shows the results of log-likelihood 
minimisation for the data with added redshift errors. Solid 
and dashed lines in the figure correspond to the expected 
mass-sheet degeneracy calculated using the weighting of 
Wi — l/cr j and Wi — const, in Eq. (|26|l respectively; dot- 
dashed lines give kq + K\ = const. 

Finally we plot the bias and the variance of the re- 
covered values for m in Fig. Ob- When the lens becomes 
critical the errors are small enough and we are able to 
constrain k\ with high accuracy. For these lenses we are 
therefore effectively able to break the mass-sheet degener- 
acy. This result is in accordance with the conclusions from 
the constant-lens model. 

Surprisingly, for the case «o = 0.9 and K\ = 0.2 we 
do not see the expected mass-sheet degeneracy, rather, 
the best-fit parameters lie along the line kq + n\ = const. 
Likely, this is due to the fact that for this model we have 
galaxies with \g\ ~ 1 close to the centre, where the surface 
mass density is k(0) = Kq + K\. Since the galaxies with 
\g\ ~ 1 are the ones that contribute most in breaking the 
mass-sheet degeneracy, it is not surprising for this model 
to see a degeneracy along the line kq + k\ = const. 

If we take 9 C to be a free parameter in the fitting, we 
obtain the degeneracy kq + k\ = const for all four sets of 
model parameters 7r t . Since 9 C is allowed to vary, it can 
adjust so that the resulting best-fit model has, for lower 
values of kq, lower values of 8 C . The region oi\g\ ~ 1 is then 
approximately unchanged as long as kq + n\ = Kot + Kit- 

5.2. Ensamble-averaged log-likelihood 

Using the log-likelihood function (|23H we can in princi- 
ple obtain the best fitting parameters 7r max given the ob- 
servations and the confidence regions on these parame- 
ters. In order to obtain the expected errors of parame- 
ters obtained from a single realization of data, one can 
calculate the ensemble-averaged log-likelihood. Ensemble- 
averaging also provides a useful test for the behaviour 
of log- likelihood function in the asymptotic limit. Given 
the redshift d istribution of th e back ground sources p(z) 
and following ISchneider et al.l l|200f)t) we write down the 
ensemble-average in the general form as 

(l)(n)=n g J d 2 9 J d 2 e Pe (e\g) J dzp z (z)l(n) , (29) 

where n g gives the number of galaxies per unit area, and 
p £ (e|(?) is the probability distribution of lensed ellipticities 
calculated using lens parameters ir t . 

Asymptotically (i.e. when the number of source galax- 
ies N g is very large), the quantity 2AI, where 

Al = (l)(ir) - (l)(ir t ) , (30) 



behaves as a random variable following a x\f distribu- 
tion, where M is the number of free parameters. In par- 
ticular, for Family I, where M — 3, we expect 68.3%, 
90%, 95.4%, and 99% points within the levels 2 Al = 
{3.53,6.25,8.02,11.2}, while for Family II (M = 2) the 
corresponding levels are 2 Al = {2.30,4.61,6.17,9.21}. It 
is interesting to test the behavior of this quantity and the 
accuracy of the asymptoti c limit in our case. 

Note that, as shown bv lOeiger fc Schneider! l)l998h . the 
cllipticity distribution is generally skewed even for rela- 
tively small (\g\ = 0.6) reduced shears; hence, we need to 
evaluate all integrations of Eq. I|29|l , including the one on 
e, numerically. The integration has been carried out using 
the GNU Scientific library routines. 

In Fig.0]we show an example of ensemble-average log- 
likelihood for one set of simulated parameters for model 
Family II; the other sets give similar results. We simu- 
lated the data using parameters kq = 0.9 and K\ = 0.2 
without adding the redshift errors. This is the extreme 
case mentioned above, where the distribution of recov- 
ered parameters lies in a very narrow valley along the 
line where kq + n\ = const (dash-dotted line) rather 
than along the expected mass-sheet degeneracy line (solid 
line and dashed line for different weighting schemes). The 
ensemble-average log-likelihood calculations confirm the 
anomalous behavior of the degeneracy. 

The 2 A; = 9.21 contour in Fig. Ushould enclose 99% 
of all points. This is not satisfied in our case; mainly due 
to the fact that we are not in the regime of asymptotic 
limit (seen through the fact that the points in the plot are 
not distributed according to Gaussian). 

6. Conclusions 

In this paper we considered a new method to break the 
mass-sheet degeneracy in weak lensing mass reconstruc- 
tions using shape measurements only. A detailed analy- 
sis of this method has also clearly shown that breaking 
the mass-sheet degeneracy is very difficult even in optimal 
conditions; arguably, it is extremely difficult in normal ob- 
servational conditions. 

Our main conclusions are summarized in the following 
items: 

1. The mass-sheet degeneracy can be broken by using red- 
shift information of the individual sources. However, 
this is effective for critical clusters only, i.e. for clus- 
ters that have sizable regions where multiple imaging 
is possible (and thus perhaps observed). The statisti- 
cal lensing analysis has to be extended close to and 
inside the critical curves of the cluster. In the regions 
far outside the critical curve, where weak lensing mass 
reconstructions are normally performed, the lens is too 
weak for the mass-sheet degeneracy to be broken by us- 
ing redshift and distortion information only, even when 
idealised conditions are employed. 

2. Using simulations we find that correlation left-over 
from the mass-sheet degeneracy transformation for 
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Fig. 4. Ensemble-averaged log-likelihood for model family 
II with k = 0.9 and «i = 0.2 (square). Contour gives 
the difference of ensemble-averaged log-likelihoods with a 
level of 2 Al = 9.21. In contrast to previous figures 1000 
recovered model parameters are plotted as dots and k +ki 
vs. Ki is plotted (note the small range on y axis). The 
solid line gives the expected mass-sheet degeneracy line 
calculated with Wi — 1/cn in Eq. (|26|) and dashed line 
is calculated with Wi — const, (both almost overlapping). 
Dash-dotted line is given by kq + n\ = const.. 

critical lenses is well described by Eq. (|12|l . k — > k' ~ 
\k+ (1 — \)(Z(z))/(Z 2 (z)), where the moments of the 
cosmological weights are calculated using Wi — 1 /at in 
JUJ). 

3. In order to break the mass-sheet degeneracy with cur- 
rent data it is necessary to extend the statistical lens- 
ing analysis closer to the cluster centre and to simul- 
taneously perform weak and strong lensing analysis of 
the cluster. This will be a subject of a future study. 

We are developing a method which combines weak 
and strong lensing mass reconstruction techniques simul- 
taneously in a non-parametric fashion. Weak and strong 
lensin g data has been previously combined by e.g. iKneihl 
( 2003J; however, only the combination of weak lensing sig- 
nal on scales > 500 kpc with the strong lensing signal at 
~ 100 kpc was taken into account. Our approach to break- 
ing the mass-sheet degeneracy can be employed also for 
non-parametric mass reconstruction. The non-parametric 
statistical lensing reconstruction technique needs to be 
extended to and within the critical regions, resulting in 
breaking the mass-sheet degeneracy in practice. Such a 
method does not rely on any assumption of the paramet- 
ric form of the potential. 

In summary, although breaking the mass-sheet de- 
generacy has proven to be surprisingly difficult, we have 



shown that it is in principle possible if one combines con- 
straints on different scales (note that we assumed the 
knowledge of the cluster-mass profile, in practice one ob- 
tains the profile by using standard weak-lensing mass re- 
constructions). The mass-sheet degeneracy is probably the 
most severe limit of current weak lensing mass reconstruc- 
tions, and is generally responsible for a significant fraction 
of the final error on the "total" mass of the cluster. Hence, 
breaking the mass-sheet degeneracy in practice is one of 
the most important challenges of weak lensing studies of 
clusters in the near future. 
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